Testing the Maximum Entropy Principle for Information Retrieval

نویسندگان

  • Paul B. Kantor
  • Jung Jin Lee
چکیده

A probabilistic information retrieval method using the Maxfrom Kantor, 1994), or (b) useful for information retrieval, imum Entropy Principle (MEP) was proposed by Cooper even if it is not true. In the present article, we examine these and Huizinga (1982). Several refinements of the MEP for questions using the TREC5 (Harman, 1996, in press) biinformation retrieval have been proposed by Kantor and corpus. In particular, we have concentrated on the 50 routing Lee (1986, 1991), but the MEP has not been evaluated in topics, and on documents which have either been judged as any large database. This article examines the MEP retrieval method using the TREC5 database. The MEP is evaluated to relevance, or are contained in 0.1%, 1%, and 10% random by several tests and compared with a ‘‘naive ordering samples of the TREC5 documents. method’’ and ‘‘lexicographic ordering method.’’ The MEP There are several ways in which the MEP might be does not provide any startling improvement, and it works found to apply. The Strong MEP asserts that the actual reasonably well only in the case of a small number of keys distribution of relevant and non-relevant documents and a relatively small collection. across the collection corresponds, as a probability distri-

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Maximum Entropy Analysis for G/G/1 Queuing System (TECHNICAL NOTE)

This paper provides steady state queue-size distribution for a G/G/1 queue by using principle of maximum entropy. For this purpose we have used average queue length and normalizing condition as constraints to derive queue-size distribution. Our results give good approximation as demonstrated by taking a numerical illustration. In particular case when square coefficient of variation of inter-arr...

متن کامل

Hierarchical Geometry Verification via Maximum Entropy Saliency in Image Retrieval

We propose a new geometric verification method in image retrieval—Hierarchical Geometry Verification via Maximum Entropy Saliency (HGV)—which aims at filtering the redundant matches and remaining the information of retrieval target in images which is partly out of the salient regions with hierarchical saliency and also fully exploring the geometric context of all visual words in images. First o...

متن کامل

Generalized Phase Retrieval Algorithm based on Information Measures

An iterative phase retrieval algorithm based on the maximum entropy method (MEM) is presented. Introducing a new generalized information measure, we derive a novel class of algorithms which includes the conventionally used error reduction algorithm and a MEM-type iterative algorithm which is presented for the first time. These different phase retrieval methods are unified on the basis of the fr...

متن کامل

A Note on the Bivariate Maximum Entropy Modeling

Let X=(X1 ,X2 ) be a continuous random vector. Under the assumption that the marginal distributions of X1 and X2 are given, we develop models for vector X when there is partial information about the dependence structure between X1  and X2. The models which are obtained based on well-known Principle of Maximum Entropy are called the maximum entropy (ME) mo...

متن کامل

News-Oriented Keyword Indexing with Maximum Entropy Principle

In our information era, keywords are very useful to information retrieval, text clustering and so on. News is always a domain attracting a large amount of attention. Aiming at news documents' characteristics and the resources available, this paper proposes to use Maximum Entropy (ME) model to conduct automatic keyword indexing. The focus of ME-based keyword indexing is how to obtain all the can...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • JASIS

دوره 49  شماره 

صفحات  -

تاریخ انتشار 1998